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We study how the adoption of an evaluation mechanism with sharing and 
memoization impacts the class of functions which can be computed in polynomial 
time. We first show how a natural cost model in which lookup for an already 
computed value has no cost is indeed invariant. As a corollary, we then prove 
that the most general notion of ramified recurrence is sound for polynomial time, 
this way settling an open problem in implicit computational complexity. 
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1 Introduction 


Traditionally, complexity classes are defined by giving bounds on the amount of resources 
algorithms are allowed to use while solving problems. This, in principle, leaves open the 
task of understanding the structure of complexity classes. As an example, a given class of 
functions is not necessarily closed under composition or, more interestingly, under various 
forms of recursion. When the class under consideration is not too large, say close enough to 
the class of polytime computable functions, closure under recursion does not hold: iterating 
over an efficiently computable function is not necessarily efficiently computable, e.g. when 
the iterated function grows more than linearly. In other words, characterizing complexity 
classes by purely recursion-theoretical means is non-trivial. 

In the past twenty years, this challenge has been successfully tackled, by giving restricted 
forms of recursion for which not only certain complexity classes are closed, but which pre¬ 
cisely generate the class. This has been proved for classes like PTime, PSpace, the polyno¬ 
mial hierarchy PH, or even smaller ones like NC (more information about related work is in 
Section ini). A particularly fruitful direction has been the one initiated by Bellantoni and 
Cook, and independently by Leivant, which consists in restricting the primitive recursive 
scheme by making it predicative, thus forbidding those nested recursive definitions which 
lead outside the classes cited above. Once this is settled, one can tune the obtained scheme 
by either adding features (e.g. parameter substitutions) or further restricting the scheme 
(e.g. by way of linearization). 

Something a bit disappointing in this field is that the expressive power of the simplest (and 
most general) form of predicative recurrence, namely simultaneous recurrence on generic 
algebras is unknown. If algebras are restricted to be string algebras, or if recursion is not 
simultaneous, soundness for polynomial time computation is known to hold [iiS. The two 


soundness results are obtained by quite different means, however: in presence of trees, one 
is forced to handle sharing [l5| of common sub-expressions, while simultaneous definitions 
by recursion requires a form of memoization [20l |. 

In this paper, we show that sharing and memoization can indeed be reconciled, and we 
exploit both to give a new invariant time cost model for the evaluation of rewrite systems. 
That paves the way towards a polytime soundness for simultaneous predicative recursion on 
generic algebras, thus solving the open problem we were mentioning. More precisely, with 
the present paper we make the following contributions: 

1. We define a simple functional programming language. The domain of the defined func¬ 
tions is a free algebra formed from constructors. Hence we can deal with functions over 
strings, lists, but also trees (see Section El). We then extend the underlying rewriting 
based semantics with memoization, i.e. intermediate results are automatically tabulated 
to avoid expensive re-computation (Section 0]). As standard for functional programming 
languages such as Haskell or OCamI, data is stored in a heap, facilitating sharing of com¬ 
mon sub-expression. To measure the runtime of such programs, we employ a novel cost 
model, called memoized runtime complexity, where each function application counts one 
time unit, but lookups of tabulated calls do not have to be accounted. 

2. Our invariance theorem (see Theorem I4.I7|) relates, within a polynomial overhead, the 
memoized runtime complexity of programs to the cost of implementing the dehned func¬ 
tions on a classical model of computation, e.g. Turing or random access machines. The 
invariance theorem thus confirms that our cost model truthfully represents the compu¬ 
tational complexity of the dehned function. 

3. We extend upon Leivant’s notion of ramified recursive functions [l^ by allowing dehni- 
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tions by generalised ramified simultaneous recurrence {GRSR for short). We show that 
the resulting class of functions, defined over arbitrary free algebras have, when imple¬ 
mented as programs, polynomial memoized runtime complexity (see Theorem I5.3I1 . By 
our invariance theorem, the function algebra is sound for polynomial time, and conse¬ 
quently GSRS characterizes the class of polytime computable functions. 


1.1 Related Work 


That predicative recursion on strings is sound for polynomial time, even in presence of simul¬ 
taneous recursive definitions, is known for a long time Q. Variations of predicative recursion 
have been later considered and proved to characterize classes like PH Q, PSpace [^, Ex- 
pTime Q or NC 11 1 . Predicative recursion on trees has been claimed to be sound for 
polynomial time in the original paper by Leivant [l^ . the long version of which only deals 
with strings [1^. After fifteen years, the non-simultaneous case has been settled by the 
second author with Martini and Zorzi 0; their proof, however, relies on an ad-hoc, in- 
finitary, notion of graph rewriting. Recently, ramification has been studied in the context 
of a simply-typed A-calculus in an unpublished manuscript 0 ; the authors claim that a 
form of ramified recurrence on trees captures polynomial time; this, again, does not take 
simultaneous recursion into account. 

The formalism presented here is partly inspired by the work of Hoffmann [l^ . where 
sharing and memoization is shown to work well together in the realm of term graph rewriting. 
The proposed machinery, although powerful, is unnecessarily complicated for our purposes. 
Speaking in Hoffmann’s terms, our results require a form of full memoization, which is 
definable in Hoffmann’s system. However, most crucially for our concerns, it is unclear how 
the overall system incorporating full memoization can be implemented efficiently, if at all. 


2 The Need for Sharing and Memoisation 

This Section is an informal, example-driven, introduction to ramified recursive definitions 
and their complexity. Our objective is to convince the reader that those definitions do 
not give rise to polynomial time computations if naively evaluated, and that sharing and 
memoization are both necessary to avoid exponential blowups. 

In Leivant’s system [^ . functions and variables are equiped with a tier. Composition 
must preserve tiers and, crucially, in a function defined by primitive recursion the tier of 
the recurrence parameter must be higher than the tier of the recursive call. This form of 
ramification of functions effectively tames primitive recursion, resulting in a characterisation 
of the class of polytime computable functions. 

Of course, ramification also controls the growth rate of functions. However, as soon as we 
switch from strings to a domain where tree structures are definable, this control is apparently 
lost. For illustration, consider the following definition. 

tree(O) = L tree(S(n)) = br(tree(n)) br(t) = B(t, t) . 

The function tree is defined by primitive recursion, essentially from basic functions. As 
a consequence, it is easily seen to be ramified in the sense of Leivant. Even though the the 
number of recursive steps is linear in the input, the result of tree(S”(0)) is the complete 
binary tree of height n. As thus the length of the output is exponential in the one of its input, 
there is, at least apparently, little hope to prove tree a polytime function. The way out is 
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(a) Explicit tree representation. 



(b) Compact DAG. 


Figure 1: Complete Binary Tree of Height Four, as Computed by tree(S"'’(0)). 


sharing: the complete binary tree of height n can be compactly represented as a directed 
acyclic graph {DAG for short) of linear size (see Figured]). Indeed, using the compact DAG 
representation it is easy to see that the function tree is computable in polynomial time. 
This is the starting point of [l5j |. in which general ramified recurrence is proved sound for 
polynomial time. A crucial observation here is that not only the output’s size, but also 
the total amount of work can be kept under control, thanks to the fact that evaluating a 
primitive recursive definition on a compactly represented input can be done by constructing 
an isomorphic DAG of recursive calls. 

This does not scale up to simultaneous ramified recurrence. The following example com¬ 
putes the genealogical tree associated with Fibonacci’s rabbit problem for n G N generations. 
Rabbits come in pairs. After one generation, each baby rabbit pair (N) matures. In each 
generation, an adult rabbit pair (M) bears one pair of babies. 

rabbits(O) = Nl 9-(0) = Ml b(0) = Nl 

rabbits(S(n)) = b(n) a(S(n)) = M(a(n), b(n)) b(S(n)) = N(a(n)) . 

The function rabbits is obtained by case analysis from the functions a and b, which are 
defined by simultaneous primitive recursion: the former recursively calls itself and the latter, 
while the latter makes a recursive call to the former. The output of rabbits(S"'(0)) is 
tightly related to the sequence of Fibonacci numbers: the number of nodes at depth i 
is given by the i**' Fibonacci number. Hence the output tree has exponential size in n 
but, again, can be represented compactly (see Figure [5]). This does not suffice for our 
purposes, however. In presence of simultaneous definitions, indeed, avoiding re-computation 
of previously computed values becomes more difficult, the trick described above does not 
work, and the key idea towards that is the use of memoization. 

What we prove in this paper is precisely that sharing and memoization can indeed be 
made to work together, and that they together allow to prove polytime soundness for all 
ramified recursive functions, also in presence of tree algebras and simultaneous definitions. 

3 Preliminaries 

General Ramified Simultaneous Recurrence Let A denote a finite (untyped) signatures T 
of constructors Ci,... ,Ck, each equipped with an arity ar(ci). In the following, the set of 
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Figure 2: Genealogical Rabbit Tree up to the Sixth Generation, as Gomputed by 
rabbits(S®(0)). 


terms T(A) is also denoted by A if this does not create ambiguities. We are interested in 
total functions from A” = A x ... x A to A. 

n times 

Definition 3.1. The following are so-ealled basic functions.' 

— For each constructor c, the constructor function f c : —>■ A for c, defined as follows: 

f c(^l j ■ • ■ ; ^ar(c) ) , . . . , 3Jar(c) ) 

— For each 1 < n < m, the (to, n)-projection function 11™ : A™ —>■ A defined as follows: 

(^1 ■ ■ ■ 5 ^ m ) — - 

Definition 3.2. 

— Given a function f : A" —>■ A and n functions gi,... ,gn, oil of them from A™ to A, 
the composition h = f o (gi,...,g„) is a function from A™ to A defined as follows: 

h.{x) = f{gi{x),...,gn{x)). 

— Suppose given the functions ti where 1 <i <k such that for some to, fi : x A" 

A. Then the function g = case({fi}i<i<k) defined by case distinction from {fi}i<i<fc is 
a function from A x A” to A defined as follows: g(ci(x), y) = fi{x, y). 

— Suppose given the functions fj, where 1 < i < k and 1 < J < n, such that for some to, 

ft . j^ar(ci) ^ ^ra ar(ci) functions {gj}l<j<n = simrec{{f{}i<i<k,l<j<n) 

defined by simultaneous primitive recursion/rom {fl}i<i<k,i<j<n ore all functions from 
A X A™ to A such that for x = xi,..., Xsr(ci)> 

gj {c, {x),y) = fl{x,gi{xi,y),...,gi ), y),..., g„ (xi, y),..., g„ y),y) . 

We denote by SimRec(A) the class of simultaneous recursive functions over A, defined as 
the smallest class containing the basic functions of Definition 13.11 and that is closed under 
the schemes of Definition 13.21 

Tiering, the central notion underlying Leivant’s definition of ramified recurrence, consists 
in attributing tiers to inputs and outputs of some functions among the ones constructed as 
above, -with the goal of isolating the polytime computable ones. Roughly speaking, the role 
of tiers is to single out “a copy” of the signature by a level: this level permits to control the 
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fiOAp^'"'^ X A A„ 

f C > An ^ ^ — >■ A„ 

t> Apj X ... X Ap^ 

^Pn case({fi}i<i<fc) > Ap x A ^ Am 

f > Apj X ... X Ap„ 

Am gi > A — Ap^ 

-7 A ar(ci) Aa a 

f ^ > Ap X Am X A — ^ Am p > m 

f o(gi,--- 

gn) > A ^ Am 

simrec{{fl}i<i<k,i<j<n) > Ap x A ^ Am 


Figure 3: Tiering as a Formal System. 


recursion nesting. Tiering can be given as a formal system, in which judgments have the 
form f 0 Apj X ... X ^ A^ for pi,... ,Par{t),'m natural numbers and f G SimRec(A). 

The system is defined in Figure [31 where A denotes the expression Aq^ x ... x A^j, for some 
Qi, ■ ■ ■ ,Qk G N. Notice that composition preserves tiers. Moreover, recursion is allowed only 
on inputs of tier higher than the tier of the function (in the case f = simrec{{fl}l<i<k,l<j<n)^ 
we require p > m). 

Definition 3.3. We call a function f G SimRec(A) definable by general ramified simulta¬ 
neous recurrence (GRSR for short) if Ap^ x ... x Ap^^^j^ —>• A™ holds. 

Remark 3.4. Consider the word algebra W = {e, a, b} consisting of a constant e and 
two unary constructors a and b, which is in bijective correspondence to the set of binary 
words. Then the functions definable by ramified simultaneous recurrence over W includes 
the ramified recursive functions from Leivant \2fi l. and consequently all polytime computable 
functions. 

Example 3.5. 

1. Consider N := {0,S} with ar(0) = 0 and ar(S) = 1, which is in bijective correspondence 
to the set of natural numbers. We can define addition add : Ni x Nj —>■ Nj for i > j, by 

add( 0 , 2 /) = nKy) = y add(S(x), 2 /) = (f s o n^)(a:, add(x, y), y) = S(add(x,y)) , 

using general simultaneous ramified recursion, i.e. {add} = s*77irec({{n}, f s olf^}}). 

2. Let T := (Nl, Ml, N, Mj, where ar(NL) = ar(ML) = 0, ar(N) = 1 and ar(M) = 2. 

Then we can define the functions rabbits : Ni —> for i > j from Section\^ by compo¬ 

sition from the following two functions, defined by simultaneous ramified recurrence. 

a(0) = Ml a(S(n)) = (fjvi o (n^,n^)) (n, a(n),b(n)) = M(a(n),b(n)) 

b(0) = Nl b(S(n)) = (f n o n|) (n, a(n), b(n)) = N(a(n)) . 

3. We can define a function ffleafs : T —^ N &y simultaneous primitive recursion which 
counts the number of leafs in T-trees as follows. 

#leafs(NL) = S(0) #leafs(ML) = S(0) 

^leafs(N(t)) = ^leafs(t) ^leaf s(M(^, r)) = add(^leaf s(Z), ^leaf s(r)) . 

However, this function cannot be ramified, since add in the last equation requires different 
tiers. Indeed, having a ramified recursive function ^leafs : —>■ Ni (for some i > 1) 
defined as above would allow us to ramify fib = ^leafs o rabbits which on input n 
computes the Fibonacci number, and is thus an exponential function. 
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f G F ti fvi 

f{vi, ...,Vk)l 

V c G C ti {. Vi 

fih,.. 

.,tk)iv 

C{ti, ...,tk)i C(vi, . ..,Vk) 


. ,pk) -^r gTZ 

Vz. picr = Vi ra fv 



Vk) 


Figure 4: Operational Semantics for Program {F^C^TZ). 


Computational Model, Syntax and Semantics We introduce a simple, rewriting based, 
notion of program for computing functions over term algebras. 

Definition 3.6. A program P is given as a triple {F,C,TZ) consisting of two disjoint signa¬ 
tures F and C of operation symbols f^ and constructors Ci,..., c„ respectively, and 

a finite set TZ of rules I —)• r over terms l,r G F{FUC, V). For each rule, the left-hand side I 
is of the form f^{pi,... ,pk) where the patterns Pj consist only of variables and constructors, 
and all variables occurring in the right-hand side r also occur in the left-hand side 1. 

We keep the program P = (F,C,TZ) fixed throughout the following. Moreover, we require 
that P is orthogonal, that is, the following two requirements are met: 

1. left-linearity: the left-hand sides I of each rule I ^ r G TZ is linear; and 

2. non-ambiguity: there are no two rules with overlapping left-hand sides in TZ. 

Orthogonal programs define a class of deterministic first-order functional programs, see e.g. 
[1]. The domain of the defined functions is the constructor algebra T{C). Correspondingly, 
elements of T(C) are called values, which we denote hy v,u,... . In Figure S] we present 
the operational semantics, realizing standard call-by-value evaluation order. The statement 
t f V means that the term t reduces to the value v. We say that P computes the function 
f : T(C)^ ^ T(C) if there exists an operation f G F, such that f (ui,..., Ufe) = u if and only 
if f{vi,... ,Vk) fv holds for all inputs Vi G T{C). 

Example 3.7 (Continued from Example 13.51) . The definition o/rabbits from Section\^ 
can be turned into a program Pr over constructors of N and T, by orienting the underlying 
equations from left to right and replacing applications of functions f G {rabbits, a, b} with 
corresponding operation symbols f G {rabbits, a, b}. For instance, concerning the function 
a, the defining equations are turned into a(0) —^ Mp and a(S(n)) —s- M(a(n), b(n)). 

Definition 3.8. For f G SimRec(A), by Pf we denote the program {Ff,Cf,TZf) where: 

— the set of operations Ft contains for each function g underlying the definition of f a 
corresponding operation symbol g. 

— the set of constructors Cf contains the constructors of A; 

— the set of rules TZf contain for each equation I = r defining a function g underlying the 
definition f the orientation I ^ r. 

Notice that due to the inductive definition of the class SimRec(A), the program Pf is finite. 
From the shape of the initial functions and operations (Definition 13.11 and Definition 13.21) it 
is also clear that Pf is orthogonal. 

Terms and Term Graphs Furthermore, we fix a set of variables V disjoint from function 
symbols. Terms over a signature F and V are defined as usual, and form a set T{F,V). A 
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term t is called ground if it does not contain variables, it is called linear if every variable 
occurs at most once in t. The ground terms are collected in The set of subterms 

STs(t) of a term t is defined by STs(t) := {t} if t S V and STs(t) := Ui<i<ar(/) STs(ti) U {t} 
if t = f{ti,... ,tar(/))- A substitution, is a finite mapping a from variables to terms. By ta 
we denote the term obtained by replacing in t all variables x € dom(CT) by a{x). If s = ta, 
we also say that s is an instance of the term t. 

We borrow key concepts from term graph rewriting (see e.g. the survey of Plump 2^ for 
an overview) and follow the presentation of Barendregt et al. Q- A term graph T over a 
signature is a directed acyelie graph whose nodes are labeled by symbols in U V, and 
where outgoing edges are ordered. Formally, T is a triple {N, sue, lab) consisting of nodes N, a 
successors function sue : N —)• N* and a labeling function lab : N ^ if UV. We require that 
term graphs are compatible with T, in the sense that for each node o € N, ii labT(o) = f € jF 
then suCj.(o) = [oi,... ,Oar(/)] and otherwise, if labT(o) = x GV, suCg.(o) = []. In the former 
case, we also write T(o) = /(oi,..., Oar(/)), the latter case is denoted by T{o) = x. We 
define the successor relation on nodes in T such that o P holds iff p occurs in suc(o), 
if p occurs at the i^^ position we also write o P- Throughout the following, we consider 
only acyclic term graphs, that is, when is acyclic. Hence the unfolding [o]t of T at 
node o, defined by [o]t := x if T(o) = x GV, and otherwise [6\t '■= /([oi]tj ■ • ■ > [ofelr) where 
T(o) = /(oi,... ,Ofc), results in a finite term. We called the term graph T rooted if there 
exists a unique node o, the root of T, with o p for every p G N. We denote by T [o the 
sub-graph of T rooted at o. Consider a symbol f G if and nodes {oi,... ,Oar(/)} Q N of T. 
The extension S' of T by a fresh node Of ^ N with S(o/) = /(oi,... ,Oar(/)) is denoted by 
Tl±) {o/ /(oi,... ,Oar(/))}. We write f{Tloi, ... ,rtoar(/)) for the term graph Siof. 

For two rooted term graphs T = (TVt, sucj., laby) and S = {Ns,sucg,\abs), a mapping 

m : Nt —t Ns is called morphic in o G Nt if (i) labT(o) = labs(TO(o)) and (ii) o p 


implies m(o) m{p) for all appropriate i. A homomorphism from T to S is a mapping 
m : Nt Ng that (i) maps the root of T to the root of S and that (ii) is morphic in all 
nodes o G Nt not labeled by a variable. We write T S to indicate that m is, possibly 
an extension of, a homomorphism from T to S. 

Every term t is trivially representable as a canonical tree A(t) unfolding to t, using a 
fresh node for each occurrence of a subterm in t. For t a linear term, to each variable a; in t 
we can associate a unique node in A(t) labeled by x, which we denote by Ox- The following 
proposition relates matching on terms and homomorphisms on trees. It essentially relies on 
the imposed linearity condition. 


Proposition 3.9 (Matching on Graphs). Let t be a linear term, T be a term graph and let 
o be a node of T. 


1. If Aft) T[o then there exists a substitution a such that ta = [o]t. 

2. Vice versa, if ta = [o]t holds for some substitution a then there exists a morphism 

A(t) Tio. 


Here, the substitution a and homomorphism m satisfy a(x) = [m{ox)]T for all variables x 
in t. 


Proof The proof is by induction on t. We hrst proof the direction from left to right. Assume 
A(t) 5'Lo- When t is a variable, the substitution a := {t >-p- [o]s} satisfies ta = [o]s. 
Since m(e) = o, we conclude the base case. For the inductive step, assume t = ffti ,..., tk)- 



f ^ {Ci—\ , ij) ■JJ'ni {Ci , Vj) {C}^ ^ f{v\^ . . . JJ'yj {C]^-\. \ , v) TfL — Tl ~\- ^T-z _ 

(Co./(ii> ■ • ■ 5 ifc)) J|m (Cfe+i, ?;) ^ 

C G C (Ci_i, ti) -l|„j (C'i, fz) TO = Xzz=l N (/('^li ■ • ■ I ^fe): v) G C 

(C'o,c(ii,... ,4)) JIzzz (C'fc,c(t;i,... ,-yfe)) {C, f{vi,... ,Vk)) ij-o {C,v) 

{f{vi,...,Vk),v) f{pi,...,pk) GTZ yi.pia = Vi {C,ra) iD,v) , 

_ (C, fjvi, . ■., Vk)) JJ-m+i (-P U {{fjvi, ■ ■ ■, v) _ ^ ^ ^ 

Figure 5: Cost Annotated Operational Semantics with Memoization for Program {J^,C,TZ). 


Fix i = 1,... ,k, and define mi{p) = m{i-p) for each position i-p in t. By case analysis on the 
nodes of A(ti) one verihes A{ti) ^rnt Thus by induction hypothesis tiai = [n^Js for 

a substitution Ui, where without loss of generality tJi is restricted to variables in ti. Define 
cr := Then ta = /(fieri,4crfc) = /([oi]s,..., [ofc]s) = [o]s, where the last 

equality follows as to is morphic on o. Moreover, from the shape of ai and rrii it is not 
difficult to see that by construction the substitution a and homomorphism m are related as 
claimed by the lemma. 

Now for the inverse direction, suppose ta = [o]s. If f is a variable and thus A{t) consists of 
a single unlabeled node, trivially A(f) S'l.o holds for m the homomorphism which maps 
the root of A{t) to o. Observe that a{t) = [o]s = [TO(e)]s, which concludes the base case. 
For the inductive step suppose t = f{ti,... ,tk), hence S{6) = /(oi,...,Ofe), tiU = [oi]s 
(i = l,...,fc) and thus by induction hypothesis A(ti) [oz]s for homomorphisms toi. 

Define the function m by m{e) := o and m{i-p) := mi{p) for alH = 1,..., /c and positions i-p 
of t. Observe that to is defined on all nodes of A(f). By definition of to one finally concludes 
the lemma, using the induction hypotheses together with the equalities A{t){i-p) = A{ti){p) 
for nodes i-p {i = 1,..., k) of A(f). □ 

4 Memoization and Sharing, Formally 

To incorporate memoization, we make use of a cache C which stores results of intermediate 
functions calls. A cache C is modeled as a set of tuples {f{vi ,..., i'ar(/)); v), where f G T and 
vi,..., Var{f) as well as v are values. Figure [S] collects the memoizing operational semantics 
with respect to the program P = {iF,C,TZ). Here, a statement (C, t) JJ-m {D,v) means that 
starting with a cache C, the term t reduces to the value v with updated cache D. The 
natural number m indicating the cost of this reduction. The definition is along the lines 
of the standard semantics (Figure S]), carrying the cache throughout the reduction of the 
given term. The last rule of Figure 0] is split into two rules (Read) and (Update). The 
former performs a read from the cache, the latter the reduction in case the corresponding 
function call is not tabulated, updating the cache with the computed result. Notice that in 
the semantics, a read is attributed zero cost, whereas an update is accounted with a cost 
of one. Consequently the cost to in (C, t) JJ-m {D,v) refers to the number of non-tabulated 
function applications. 

Lemma 4.1. We have (0,t) -IJ-m iC,v) for some m gN and cache C if and only if t fv. 
Proof. Call a cache C proper if {f{vi,... ,Vk),v) G C implies f(vi,... ,Vk) / v. For the 
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direction from right to left, we show the following stronger claim. 

Claim 4.2. Suppose t ].v and let cache Ci be a proper cache. Then (C' 2 ,f) for 

some m. 

The proof is by induction on the deduction 11 of the statement t f v. 

1. Suppose that the last rule in If has the form 

tijvi /(pi,. ■. ,pfc)-)• r g pia = Vi rafv 
f{ti, ...,tk)lv 

We consider the more involved case where at least one U is not a value. By induction 
hypothesis, we obtain proper caches Dq, ..., Dk with Dq = Ci and JJ-m; 

{Di-i,Vi). By the rule (Split), it suffices to show {Dk, /(ui,..., Vk)) -(In (C 2 ,v) for C 2 
a proper cache. We distinguish two cases. Consider the case (f(vi,... ,Vk),u) G Dk 
for some u. Using that /{ui,... ,Vk) f u implies u = u for orthogonal programs, we 
conclude the case by one application of rule (Read). Otherwise, we conclude by rule 
(Update) using the induction hypothesis on ra f v. Note that the resulting cache is 
also in this case proper. 

2. The final case follows directly from induction hypothesis, using the rule (Constructor). 
For the direction from left to right we show the following stronger claim 

Claim 4.3. Suppose {Ci,t) fj-^ {C 2 ,v) for a proper cache Ci. Then t fv. 

The proof is by induction on the deduction 11 of the statement {Ci,t) JJ-m (C 2 ,v) 

1. Suppose first that the last rule in If is of the form 

— 1; 'JJ'mi ■ • -jVk)) (Ck+l,v) 

• 7 -Jim 

By induction hypothesis, we see ti f Vi, using also that the configurations Ci are all 
proper by the previous claim. As we also have f(vi ,..., Vk) i u by induction hypothesis, 
it follows that some rule f(pi, ■ ■ ■ ,Pk) r £ TZ matches f{vi ,..., Vk). Putting things 
together, we conclude by one application (Function). 

2. The remaining cases where the last rule in If is (Constructor), (Read) or (Update) 
follow either from the assumption that Ci is proper, or from induction hypothesis 
using that by the previous claim the intermediate and resulting caches are all proper. 

□ 

The lemma confirms that the call-by-value semantics of Section[3]is correctly implemented 
by the memoizing semantics. To tame the growth rate of values, we define small-step se¬ 
mantics corresponding to the memoizing semantics, facilitating sharing of common sub¬ 
expressions. 
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Small-Step Semantics with Memoization and Sharing To incorporate sharing, we extend 
the pairs (C, t) by a heap, and allow references to the heap both in terms and in caches. Let 
Loc denote a countably infinite set of locations. We overload the notion of value, and define 
expressions e and (evaluation) contexts E according to the following grammar: 

v.= i \ c{vi,...,Vk); 

e-.= i\ ..., 4), e) I /(ei, ...,ek)\ c(ei,..., e^); 

E :=a\ {f{ii,...,ek),E) I f{£i,... ,ii_i,E,ei+i,... ,ek) \ c^i,..., 4_i, LI, e^+i,..., Cfe). 

Here, £i,...,£k,( & Loc, f & £F and c € C are fc-ary symbols. An expression is a term 
including references to values that will be stored on the heap. The additional construct 
{f{£i,... ,£k),e) indicates that the partially evaluated expression e descends from a call 
f(yi,... ,Vk), with arguments Vi stored at location £i on the heap. A context E is an 
expression with a unique hole, denoted as □, where all sub-expression to the left of the 
hole are references pointing to values. This syntactic restriction is used to implement a 
left-to-right, call-by-value evaluation order. We denote by E[e] the expression obtained by 
replacing the hole in A by e. 

A configuration is a triple {D,H,e) consisting of a cache D, heap H and expression e. 
Unlike before, the cache D consists of pairs of the form {f{£i, ■ ■ ■ ,£k),£) where instead of 
values, we store references £i,... ,£k,£ pointing to the heap. The heap H is represented as 
a (multi-rooted) term graph H with nodes in Loc and constructors C as labels. If f is a 
node of H, then we say that H stores at location £ the value [£]h obtained by unfolding H 
starting from location £. We keep the heap in a maximally shared form, that is, H{£a) = 
c(£i,... ,£k) = H{£b) implies 4 = £b for two locations £a,£b of H. Thus crucially, values are 
stored once only, by the following lemma. 

Lemma 4.4. Let H be a maximally shared heap with locations £\,£’i. If [£i\h = [£ 2 ]h then 

4=4. 

The operation merge(iL, c(£i,... ,£k)), defined as follows, is used to extend the heap H 
with a constructor c whose arguments point to £\,...,£k, retaining maximal sharing. Let 
£f he the first location not occurring in the nodes N oi H (with respect to an arbitrary, but 
fixed enumeration on Loc). For £i,... ,£k & N we define 


merge(iJ, c(£i,... , 4 )) 


{H,£) iiH{£)=c{£i,...,£k), 

(H U{£f 1 -^ c{£i,..., £k)},£f) otherwise. 


Observe that the first clause is unambiguous on maximally shared heaps. 

Figure [ 6 ] collects the small step semantics with respect to a program P = (T,C,Tl). We 
use —T’rsm to abbreviate the relation —U —U —and likewise we abbreviate — 7 .^ U —by 
“^Rrsm- Furthermore, we define —■= “^rsm ’ ' ~^rsm- Hence the m-fold composition 

corresponds to a —>'nrsiii"i'C‘iciction with precisely m applications of —Throughout 
the following, we are interested in reductions over well-formed configurations: 

Definition 4.5. A configuration {D, H, e) is well-formed if the following conditions hold. 


1. The heap H is maximally shared. 

2. The cache D is a function, and compatible with e, that is, if {f{£i,. ■., £k), e') occurs as 
a sub-expression in e, then (f(£i,..., £k), £) ^ D for any £. 
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{f{£i,...,£k)J) ^ D f{pi,...,pk) eTZ T := A{f{pi,... ,pk)) 

T fjHlii,.. .,H[£k) cTra := {x !->■ w(4) I 4 e Loc, r(4) = X e V} ^ 

(i^, i?, , 4)]) -^R {D, H, E[{fiA, 4), ram)]) 


{f{£im--,£k),£)&D 


{D, H, i?[/(4,..., 4)]) (D, H, m) 


(read) 


{D, H, E[{f{A, ..., 4), £)]) {D U {(/(4,..., 4), £)}, H, E[£]) 
(H'J) = merge{H,c{£i,. .. ,4)) 


(store) 


{D, H, E[c{£u. .., 4)]) {D, H', E[e]) 


(merge) 


Figure 6: Small Step Semantics with Memoization and Sharing for Program (E,C,TV). 


3. The configuration contains no dangling locations, that is, H{£) is defined for each location 

£ occurring in D and e. 

Lemma 4.6. 1. If {D,E[,E[e]) is well-formed then so is {D,H,e). 

2 . If {Di,Hi,ei) — S'Rrsm {D 2 , H 2 ,e 2 ) and {Di, IIi,ei) is well-formed then so is (£>2, ^^2, 62). 

Proof. It is not difficult to see that Assertion [T] holds. To see that Assertion [5] holds, fix 
a well-formed configuration {Di,IIi,ei) and suppose (£>i,i/i,ei) -tRrsm (£*2,62). We 
check that (£>2, i?2, 62) is well-formed by case analysis on 

1 . The heap H 2 is maximally shared: As only the relation —modifies the heap, it suffices 
to consider the case (£>i, i?i, ei) — (£>2, 112 , 62 ). Then {Il 2 ,£) = merge(iJi, c(t'i ,... ,£k)) 
for some location £, and the property follows as merge preserves maximal sharing. 

2. The cache D 2 is a function: It suffices to consider the rules —> 3 . As immediate 
consequence of compatibility of Di with ei it follows that D 2 is a function. 

3. The cache D 2 is compatible with 62 '. Only the rules —and —>-3 potentially contradict 
compatibility. In the former case, the side conditions ensure that 62 and D 2 are 
compatible, in the latter case compatibility follows trivially from compatibility of Di 
with Cl. 

4. No dangling references: Observe that only rule introduces a fresh location. The 
merge operations guarantees that this location occurs in the heap ff 2 - 


□ 

From now on, if not mentioned otherwise we will suppose that configurations are well- 
formed, tacitly employing Lemma 14.61 

It is now time to show that the model of computation we have just introduced fits our 
needs, namely that it faithfully simulates big-step semantics as in Figure [5] (itself a correct 
implementation of call-by-value evaluation from Section |3|) . This is proven by first showing 
how big-step semantics can be simulated by small-step semantics, later proving that the 
latter is in fact deterministic. 
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In the following, we denote by {e\H the term obtained from e by following pointers to the 
heap, ignoring the annotations (/(f'l,..., t'fe), •). Formally, we define 

ri _ if e =/(d,..., Cfe), 

\W]h ife= (/(£i,...,4),e'). 

Observe that this definition is well-defined as long as H contains all locations occurring in 
e. Likewise, we set [D]h := 

Our simulation result relies on the following two auxiliary lemmas concerning heaps. The 
first is based on the observation that in -^[jj.g,j-reductions, the heap is monotonically increas¬ 
ing. 

Lemma 4.7. If {Di, iLi, ei) —^-Rrsm {^2, H2, 62 ) then the following properties hold: 

1 - [^]h 2 = [^]hi for every location £ of Hi; 

2 . [Di]h 2 = [Di]hi and = [ei\Hx- 

Proof. As for any other step the heap remains untouched, the only non-trivial case is 
{Di,Hi,E[c{£i,...,£k)]) {Di,H 2,E[£]) with {H2,£) = merge(iLi, c(£i,..., 4 )). Ob¬ 

serve that by definition of merge, H2{£) = Hi{£) for every £ € Nh^- From this Assertion [T] 
is easy to establish. Assertion [5] follows then by standard inductions on Di and E, respec¬ 
tively. □ 

Lemma 4.8. Let (D., E[v]) be a configuration for a value v. Then {D, H, E[v]) —>■* 

{D,H',E[£]) with [£]h> = 

Proof. Note that by assumptions, e consist only of constructors and locations. We proof 
the lemma by induction on the number of constructor symbols in e. In the base case, 
e = £ and the lemma trivially holds. For the inductive step, it is not difficult to see that e = 
E'[c{£i,... ,£k)] for some evaluation context FI', and hence {D, H, E[e]) — {D, H', E[E'[£]]), 
where {H', £) = merge(iF, c(£i ,... ,£k)). Using that {£]h' = [c{£i,... ,£k)\H' by definition of 
merge and Lemma imi^ we conclude We complete this derivation 

to the desired form, by induction hypothesis. □ 

An initial configuration is a configuration of the form ( 0 ,iF, e) with H a maximally 
shared heap and e = /(ui ,... ,Vk) an expression unfolding to a function call. Notice that 
the arguments vi,... ,Vk are allowed to contain references to the heap H. 

Lemma 4.9 (Simulation). Let ( 0 ,iF, e) be an initial configuration. //( 0 , [e]ij) ifm (4 u) 
holds for m > 1 then ( 0 , iJ, e) — G,£) for a location I in G with [£]g = v. 

Proof. Call a configuration {D,H,e) proper if it is well-formed and e does not contain a 
sub-expression (/(ui ,... ,Vk),e'). We show the following claim: 

Claim 4.10. For every proper configuration {Di, Hi, ei), JJ-m (02,v) implies 

{Di,Hi,ei) -^ 4 , ■ {D2,H2,£) with = (4,^). 

Observe that —>rsm ' ^'n/rsm — ~^R/rsm whenever m > 0. Since an initial configuration is 
trivially proper, the lemma follows from the claim. 

To prove the claim, abbreviate the relation —by for all m G N. Below, 
we tacitly employ —• —>."*2 _ _^mi+m 2 j-pj. mi, m2 G N. The proof is by induction on 
the deduction 11 of the statement {[Di]h, Jj-m (C, n). 
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1. Suppose that the last rule in 11 has the form 

C G C — tj) Ij-mi ^ — ^2—1 

(( 1/0 'O'm {Ck i ^('Ui, . . . , U/c ) ) 

Fix a proper configuration (Dq, Hq, cq) unfolding to {Co,c{ti,... ,tk)). Under these 
assumptions, either eg is a location or eg — c(ei,..., e/c). The former case is trivial, 
as then t is a value and thus m = 0. Hence suppose eo = c(ei,..., efe). We first show 
that for alH < fc, 

{Do, Ho, c(ei,..., e/c)) —c(£i, e^+i,..., e/c)) , (f) 

for a configuration (Z?/, Hi, c(£i, e^+i,..., e/c)) which unfolds to the configura¬ 

tion {Ci,c{vi,..., Vi,ti+i ,..., tk)). The proof is by induction on i, we consider the step 
from i to Induction hypothesis yields a well-formed configuration {Di, Hi, £’[e/+i]) 
for E = c{£i,... ,^i,U, ei+ 2 ,..., e/c) reachable by a Derivation ([f]). As the configuration 
{Di, Hi, ei+i) unfolds to ((7/, U+i), the induction hypothesis of the claim on the assump¬ 
tion ((7/,U-|_i) fj-mi-i-i (I 7 /-I- 1 , Uj-i-i) yields (D/, FZj, e/-|_i) y (-Dj_|_ 2 ,Z 7 j_|_i,-Z/-|_i) where 
the resulting configuration unfolds to {Ci+i,Vi+i). As a consequence, its not difficult to 
see that also {Di,Hi,E[ei+i]) {Di+i,H,+i,E[ii+i]) holds. Since [U-i-i]ffi+i = 

Vi+i and [E[ei+i]]Hi = c(ei,..., u/, U+i, U+ 2 ,..., 4 ), using Lemma |4Jl[2]) on the last 
equality it is not difficult to see that = c(ei ,... ,Vi, Ui+i, U+ 2 ,..., tk)- As 

we already observed = Ci+i, we conclude the Reduction ffl- 

In total, we thus obtain a reduction {Do, Ho, c(ei,..., e/c)) —>■"* {Dk, Hk,c{£i,... ,ik)) 
where m = Yli=i {Dk, Hk, c{£i,..., £k)) is a well-formed, in fact proper, config¬ 

uration which unfolds to {Ck,c{vi,... ,Vk))- Employing ——!•* = —we conclude 
the case with Lemma [4.81 

2. Suppose that the last rule in H has the form 

— 1 j ^z) ^rrii {Ci,Vi) {Ck,f{vi,...,Vk)) \£n{Ck+i,v) m = 

{Co, f{ti, •. ■, tk)) JJ-m {Ck+i,v) 

Fix a proper configuration {Do, Ho, eo) unfolding to {Co, f{ti, ■ ■ ■, tk))- By induction on 
k, exactly as in the previous case, we obtain a proper configuration {Dk,Hk, f{£i, - - -, £k)) 
unfolding to {Ck,f{vi,- --,Vk)) with 

(Do,iLo,eo)->^-=i’"^ {Dk,Hk,f{£i,---,£k)) - 

The induction hypothesis also yields configuration {Dk+i, Hk+i,£) unfolding to {Ck+i,v) 
with {Dk, Hk, f{£i, - - - ,£k)) — {Dk+i, Hk+i,£)- Summing up we conclude the case. 

3. Suppose that the last rule in H has the form 

{f{vi,--.,Vk),u) € C 
{C,f{vi,---,vk)) J|o {C,v) 

Consider a proper configuration {D,H,e) that unfolds to {C, f{vi, -. - ,Vk))- Then 
e = /(ei,..., Ck), and using k applications of Lemma [4.81 we construct a reduction 

{D,H,f{ei,--.,ek))^: {D, H^, f{£,,e2, - - - ,ek)) {D, Hk, f{£i, - - - ,£k)) , 
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with {D, Hk,f{£i, ■ ■ ■ ,(-k)) unfolding to (C, f(vi ,..., Vk))- Lemmaand the assump¬ 
tion on C = [D]H^, implies that there exists a unique pair S D with 

.. ■,ek)]Hk = ■.,Vk) and [e]Hk = v. Thus overall 

[D, H, e) = {D, H, /(ei,..., Ck)) {D, /(£i,..., 4)) , 

where {D,Hk,i) unfolds to {C,v). Using —>■* • —C—5>° we conclude the case. 

4. Finally, suppose that the last rule in If has the form 

(/4i, ■. ■ ^ C f{li,...,lk) €n Wi. ha = Vj {C,ra) ij-m {D,v) 

{C,f{vi,...,Vk)) -U-m-hi (£>U{(/(z;i,...,?;fe),-!;)},?;) 

Fix a proper configuration {D, H, e) that unfolds to (C, f(vi ^..., Vk)), in particular e = 
/(ei,..., Cfe). As above, we see (D, iJ, e) —>■* {D, Hk, f{£i ,..., £k)) for a configuration 
{D,Hk,fiii,. ■ .,ik)) also unfolding to {C,f{vi,.. .,Vk)). As (/4i,.. ■,Vk),v) ^ C, we 
have {f{£i ,..., £k),£) ^ Dk for any location £, by Lemma l4^ Since Proposition lS.QI on 
the assumption yields A(/(Zi ,... ,lk)) f{H [li,..., H[£u) for a matching morphism 

TO, in total we obtain 

[D, H, e) (D, H, /(4,..., 4)) {D, Hk, (/(4, • ■ •, 4), ra„)) • 

Note that by Proposition 13.91 the substitution a and induced substitution am sat¬ 
isfy a{x) = [om(x)]Hk for all variables x in t. Hence by a standard induction on 
r, [ram]Hk = rcr follows. We conclude that {D, Hk, {f{£i, ■ ■ ■ :£k),ram)) unfolds to 
(C,ra). Thus the induction hypothesis yields a well-formed configuration {D',G,£) 
unfolding to {C,v) with (D, Hk,ram) {D',G,£). Thus 

(D, Hk, {f{£i,..., £k),ram)) ^^{D', G, (/(4, ■ • ■, 4), £)) 

^3 (7^'u{(/(4,...,4)4)},G,£) . 

Using that [£]g = v and [fi£i ,..., 4)]^^ = /('Ci, ■ • ■, Vk), Lemma |121 yields 

[O' U {(/(4,..., 4), £)}]g = G' U {(/4i, ..., 4} • 

Putting things together, employing —)•* • —j-j; C —and —we conclude 
{D,H,e) (£)' u {{f{£i, ■ ■ ■ ,£k),£)},G,£), where the resulting configuration un¬ 

folds to (C" U {(/(ui,..., Vk), r^)}, v). We conclude this final case. 


□ 

The next lemma shows that the established simulation is unique, that is, there is exactly 
one derivation {0,H, e) ->r/rs. (D, G,£). Here, a relation —>■ is called deterministic on a set 
A if 61 c— a -> 62 implies bi = 62 for all a € A. 

Lemma 4.11 (Determinism). 

1. The relations ——>-3 and —are deterministic on well-formed configurations. 

2. The relation —^Rrsm *5 deterministic on well-formed configurations. 
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Proof. For Assertion [U fix — G {“^rj ~^m}- Let {D,H,e) be a well-formed con¬ 

figuration. We show that any peak {Di, Hi,ei) j.<r- {D,H,e) —(£* 2 ,-^ 2562 ) is trivial, i.e. 
{Di, Hi,ei) = (I? 2 j L^ 2 , 62 ). Observe that independent on —the evaluation context E in 
the corresponding rule is unique. From this, we conclude the assertion by case analysis on 
—The non-trivial cases are —and —= —>j.. In the former case, we conclude 
using that rules in TZ are non-overlapping, tacitly employing Proposition 13.91 The latter 
case we conclude using that D is well-formed. 

Finally, for Assertion [2] consider a peak {Di,Hi,ei) {D,H,e) —(- 02 , 1 ^ 2 , 62 ) 

S {~^Ri We show that this peak is trivial by induction on the 

expression e. By the previous assertion, it suffices to consider only the case —^ 

The base case constitutes of the cases (i) e = f{ii, ■ ■ ■ ,ik), (h) e = ■ ■ ■ ,^fc),I') and 

(iii) e = c{£i,... ,ik). The only potential peak can occurs in case (i) between relations —>- 5 ^ 
and ^j.. Here, a non-trivial peak is prohibited by the pre-conditions put on H. For the 
inductive step, we consider a peak 


{D^,H^,E[e[]) {D,H,E[e']) (D 2 ,iL 2 , A[e']) , 

where e = E[e'] for a context E. As we thus have a peak {D,H,e') — 

(I? 2 , i^ 2 , 62 ), which by induction hypothesis is trivial, we conclude the assertion. □ 

Theorem 4.12. Suppose (0, /(ui,..., Vk)) -IJ-m (C, v) holds for a reducible term f{vi ,..., Vk). 

Then for each initial configuration ( 0 , iJ, e) with [e]H = /('Ci, ■ ■ ■ ,Vk), there exists a unique 
sequence ( 0 ,i?, e) — {D,G,£) for a location i in G with [£]g = v. 

Proof. As /(ui,..., Vk) is reducible, it follows that m > 1. Hence the theorem follows from 
Lemma [4.91 and Lemma [4.Ill □ 

Invariance Theorem 14.121 tells us that a term-based semantics (in which sharing is not 
exploited) can be simulated step-by-step by another, more sophisticated, graph-based se¬ 
mantics. The latter’s advantage is that each computation step does not require copying, 
and thus does not increase the size of the underlying configuration too much. This is the 
key observation towards invariance: the number of reduction step is a sensible cost model 
from a complexity-theoretic perspective. Precisely this will be proven in the remaining of 
the section. 

Define the size \e\ of an expression recursively by \£\ := 1, |/(ei,..., ek)\ := 1 -f 
and ■ ■ ■ ,£k), e)| := 1 + |e|. In correspondence we define the weight wt(e) by ignoring 

locations, i.e. wt(.^) := 0. Recall that a reduction (Di, iJi, ei) {^> 2 , H 2 , 62 ) consists of 

m applications of all interleaved by —>j.s„,-reductions. As a first step, we thus estimate 

the overall length of the reduction (Di,iLi,ei) —>^rsm {D 2 , H 2 ,e 2 ) in m and the size of ei. 

Set A := max{|r| | 1 —>■ r G TZ}. The following serves as an intermediate lemma. 

Lemma 4.13. The following properties hold: 

1. If (Di, iJi, ei) {D 2 , H 2 , 62 ) then wt(e 2 ) < wt(ei). 

2. If (I?i, iJi, ei) —J-pi {D 2 , 112 , 62 ) then wt(e 2 ) < wt(ei) -I- A. 

Proof. The first assertion follows by case analysis on —For the second, suppose (Di , iLi, ei) —>j.sm 
[ 02 , 112 , 62 ) where ei = E[f{£i ,... ,4)] and 62 = E[{f[li ,... ,4),rcrm)] for a rule/(li ,... ,lk) 
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r G TZ. Observe that since the substitution am replaces variables by locations, A > |r| = 
\ram\ > wt(r(Tm) holds. Consequently, 

wt(/(£i, ... ,4)) + A > 1 + wt{ram) = wt((/(£i,.. .,£k),ram)) ■ 


Then the assertion follows by a standard induction on E. 


□ 


Then essentially an application of the weight gap principle 17| , 
analysis, binds the overall length of an —>-^j. 3 „,-reduction suitably. 


a form of amortized cost 


Lemma 4.14. //(Di, iJi, ei) p 2 ,i? 2 ,e 2 ) then {Di,Hi,ei) (T> 2 ,^f 2 ,e 2 ) for 

n < (1 + A) • m + wt(e) and A S N a constant depending only on P. 


Proof. For a configuration c = {D,H,e) define wt(c) wt(e) and let A be defined as in 
Lemma 14.131 Consider {Di, Hi,ei) (^ 2 , ^^ 2 , 62 ) which can be written as a reduction 

(Hi, iJl, ei) = Co —trsm Cl —>rim —^r ' ' ' “>rsm , (t) 


of length n := m + X]i=o Lemma 14.131 yields (i) Ui < wt(ci) — wt(di) for all 0 < i < m; 
and (ii) wt(ci+i) — wt(di) < A for all 0 < i < m. Hence overall, the Reduction ([|]) is of 
length 


n < m + (wt(co) — wt(do)) + • • • + (wt(cm) — wt(dm)) 

= m + wt(co) + (wt(ci) - wt(do)) H-P (wt(cm) - wt(dm_i)) - wt(dm) 

< m + wt(co) + TO • A 
= (1 + A) • TO + wt(e) . 


The lemma follows. 


□ 


Define the size of a configuration |(H,H, e)| as the sum of the sizes of its components. 
Here, the size \D\ of a cache D is defined as its cardinality, similar, the size \H\ of a heap 
is defined as the cardinality of its set of nodes. Notice that a configuration (H, H, e) can be 
straight forward encoded within logarithmic space-overhead as a string \{D, H,e)'], i.e. the 
length of the string \{D,H, e)] is bounded by a function in 0(log(n) • n) in \ {D,H,e)\, using 
constants to encode symbols and an encoding of locations logarithmic in \H\. Crucially, a 
step in the small-step semantics increases the size of a configuration only by a constant. 

Lemma 4.15. If{Di,Hi,ei) ->R,rsm {D2,H2,e2) then \{D2,H2,e2)\ < |(Hi,ili,ei)| -P A. 

Proof. The lemma follows by case analysis on the rule applied in (Hi, TJi, ei) —>Rrsm (^ 2 , H 21 O 2 ), 
using 1 < A. □ 

Theorem 4.16. There exists a polynomial p : N x N —>■ N such that for every initial con¬ 
figuration (0,ili,ei), a configuration {D2, H2, 02) with (0,ili,ei) — iD2, H2,02) is 
computable from (0, Hi, ei) in time P{\Hi \ -P |ei |, to). 

Proof. It is tedious, but not difficult to show that the function which implements a step 
c —>Rrsm '^) i-6- which maps |"c] to [d], is computable in polynomial time in [c], and thus in 
the size |c| of the configuration c. Iterating this function at most n := (I-PA)-to-P|(0, Hi, ei)| 
times on input [(0, Hi, ei)], yields the desired result [(H 2 ,H 2 ,e 2 )] by Lemma 14.141 Since 
each iteration increases the size of a configuration by at most the constant A fLemma l4.15ll . 
in particular the size of each intermediate configuration is bounded by a linear function in 
|(0,Hi,ei)| = |Hi| -P |ei| and n, the theorem follows. □ 
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Combining Theorem 14.121 and Theorem 14.161 we thus obtain the following. 

Theorem 4.17. There exists a polynomial p : NxN ^ N such that for {0,f{vi,... ,Vk)) -IJ-m 
{C,v), the valuer represented as DAG is computable from vi,... ,Vk in timep{^^^.^\vi\,m). 

Theorem 14.171 thus confirms that the cost m of a reduction ... ,Vk)) ifm {C,v) is 

a suitable cost measure. In other words, the memoized runtime complexity of a function /, 
relating input size n S N to the maximal cost m of evaluation f on arguments vi,... ,Vk of 
size up to n, i.e. (0, f{vi,... ,Vk)) JJ-m {C, v) with 1^*1 < n, is an invariant eost model. 

Example 4.18 (Continued from Example 13.71) . Reeonsider the program Pr and the eval¬ 
uation of a call rabbits(S^{0)) which results in the genealogical tree Vn of height n S N 
associated with Fibonacci’s rabbit problem. Then one can show that rabbits{S^{0)) ifm Vn 
with m < 2 • n + 1. Crucially here, the two intermediate functions a and b defined by simul¬ 
taneous recursion are ealled only on proper subterms of the input S"'(0), hence in partieular 
the rules defining a and b respectively, are unfolded at most n times. As a consequence of the 
bound on m and Theorem \4.1'il\ we obtain that the function rabbits from the introduction 
is polytime computable. 

Remark 4.19. Strictly speaking, our DAG representation of a value v, viz the part of the 
final heap reachable from a corresponding location t, is not an encoding in the classical, 
complexity theoretic setting. Different computations resulting in the same value v can pro¬ 
duce different DAG representations of v, however, these representations differ only in the 
naming of locations. Even though our encoding can be exponentially compact in compari¬ 
son to a linear representation without sharing, it is not exponentially more succinct than a 
reasonable encoding for graphs (e.g. representations as circuits, see Papadimitriou f^I ). In 
such succinct encodings not even equality can be decided in polynomial time. Our form of 
representation does clearly not fall into this category. In particular, in our setting it can be 
easily checked in polynomial time that two DAGs represent the same value. 

5 GRSR is Sound for Polynomial Time 

Sometimes (e.g., in 0), the first step towards a proof of soundness for ramified recursive 
systems consists in giving a proper bound precisely relating the size of the result and the 
size of the inputs. More specifically, if the result has tier n, then the size of it depends 
polynomially on the size of the inputs of tier higher than n, but only linearly, and in very 
restricted way, on the size of inputs of tier n. Here, a similar result holds, but size is replaced 
by minimal shared size. 

The minimal shared size ||r!i,..., Ufe|| for a sequence of elements vi,... ,Vk S A is defined 
as the number of subterms in ui,..., Vk, i.e. the cardinality of the set Ui<i<fc STs(ui). Then 
||ui,..., rifcll corresponds to the number of locations necessary to store the values v\,... ,Vk 
on a heap (compare Lemma 14.41) . If A is the expression A„j^ x ... x A„^, n is a natural 
number, and Tis a sequence of m terms, then is defined to be \\ti^,... ,ti,^\\ where 

ii,... ,ik are precisely those indices such that ,...,> n. Similarly for ||tllX"- 

Proposition 5.1 (Max-Poly). // f > A —>• A„, then there is a polynomial Pf : N —>■ N such 
that ||f(i/)|| < ||u||I”+Pf(||?;||A”)- 

Once we know that ramified recursive definitions are not too fast-growing for the minimal 
shared size, we know that all terms around do not have a too-big minimal shared size. As 
a consequence: 
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Proposition 5.2. //f>A^A„, then there is a polynomial Pf : N —>■ N such that for every 
V, (0,/('iT))-IJ-m iC,v), with m < pf{\\v\\). 


The following, then, is just a corollary of Proposition 15.21 and Invariance (Theorem HTT]). 

Theorem 5.3. Let f : x ... x Ap^, —^ Am be a function defined by general ramified 

simultaneous recursion. There exists then a polynomial pf : —>■ N such that for all inputs 

vi,. ■ ■ ,Vk, a DAG representation of f{vi, ,Vk) is computable in time Pf (|ui |,..., |u„|). 

Example 5.4 (Continued from Example 14.18|) . In Exo,mvle \S.5\ vie indicated that the func¬ 
tion rabbits: N —>■ T from Section\^ is definable by GRSR. As a consequence of Theo¬ 
rem fOl it is computable in polynomial time, e.g. on a Turing machine. Similar, we can 
prove the function tree from Section\^polytime computable. 


6 Conclusion 


In this work we have shown that simultaneous ramified recurrence on generic algebras is 
sound for polynomial time, resolving a long-lasting open problem in implicit computational 
complexity theory. We believe that with this work we have reached the end of a quest. Slight 
extensions, e.g. the inclusion of parameter substitution, lead outside polynomial time as soon 
as simultaneous recursion over trees is permissible. 

Towards our main result, we introduced the notion of memoized runtime complexity, and 
we have shown that this cost model is invariant under polynomial time. Crucially, we use 
a compact DAG representation of values to control duplication, and tabulation to avoid 
expensive re-computations. To the authors best knowledge, our work is the first where 
sharing and memoization is reconciled, in the context of implicit computational complexity 
theory. Both techniques have been extensively employed, however separately. Essentially 
relying on sharing, the invariance of the unitary cost model in various rewriting based models 
of computation, e.g. the A-calculus and term rewrite systems 0, [l^ could be proven. 

Several works (e.g. rely on memoization, employing a measure close to our notion 

of memoized runtime complexity. None of these works integrate sharing, instead, inputs are 
either restricted to strings or dedicated bounds on the size of intermediate values have to 
be imposed. We are confident that our second result is readily applicable to resolve such 
restrictions. 
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